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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Thomas, Terry L. 



(ii) TITLE OF INVENTION: PRODUCTION OF GAMMA LINOLENIC ACID BY A 
DELTA 6 -DESATURASE 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

(B) STREET: 4 0.0 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 11530 



A (v) COMPUTER READABLE FORM: 

= (A) MEDIUM TYPE: Floppy disk 

3 (B) COMPUTER: IBM PC compatible 

U (C) OPERATING SYSTEM: PC - DOS /MS - DOS 

J (D) SOFTWARE : Patentln Release #1.0, Version #1.25 

J 

n (vi) CURRENT APPLICATION DATA: 

n 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

J(viii) ATTORNEY/ AGENT INFORMATION: 

=s (A) NAME: Presser, Leopold 

f (B) REGISTRATION NUMBER: 19,827 

(C) REFERENCE/DOCKET NUMBER: 8383 Z YXWVU 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 230 901 SANS UR 




(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2002.. 3081 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GCTAGCCACC 


AGTGACGATG 


CCTTGAATTT 


GGCCATTCTG 


ACCCAGGCCC 


GTATTCTGAA 


60 


TCCCCGCATT 


CGCATTGTTA 


ATCGTTTGTT 


CAACCATGCC 


CTGGGTAAAC 


GTTTAGACAC 


120 


CAgCTTGCCA 


GACCACGTTA 


GTTTGAGTGT 


TTCCGCCCTG 


GCGGCCCCGA 


TTTTTTCCTT 


180 


tgSggctttg 


GGCAATCAGG 


CGATCGGGCA 


ATTGCGTTTG 


TTTGACCAGA 


CTTGGCCCAT 


240 


TC^GGAAATT 


GTCATTCACC 


AAGACCATCC 


CTGGCTCAAT 


TTACCCCTGG 


CGGATTTATG 


300 


GGjJrGATCCG 


AGCCGAATGT 


TGATCTATTA 


CCTACCGGCC 


CACAGTGAAA 


CGGATTTAGT 


360 


AG||CGCAGTG 


GTGAATAATT 


TAACGTTGCA 


ATCTGGGGAC 


CATTTAATAG 


TGGGACAAAA 


420 


AC$jcCAACCC 


AAGACCAAAC 


GGCGATCGCC 


TTGGCGCAAA 


TTTTCCAAAC 


TGATTACCAA 


480 


c c¥gc GGG AG 


TATCAGCGGT 


ATGTCCAACA 


GGTGATATGG 


GTGGTGTTGT 


TTTTATTGTT 


540 


GATGATTTTT 


CTGGCCACCT 


TCATCTACGT 


TTCCATTGAT 


CAACATATTG 


CCCCAGTGGA 


600 


CGCGTTGTAT 


TTTTCCGTGG 


GCATGATTAC 


CGGGGCCGGT 


GGCAAGGAAG 


AGGTGGCCGA 


660 


AAAGTCCCCC 


GATATCATCA 


AAGTATTCAC 


AGTGGTGATG 


ATGATCGCCG 


GGGCGGGGGT 


720 


GATTGGTATT 


TGTTATGCCC 


TACTGAATGA 


TTTCATCCTT 


GGCAGTC GCT 


TTAGTCAGTT 


780 


TTTGGATGCG 


GCCAAGTTAC 


CCGATCGCCA 


TCACATCATC 


ATTTGTGGGC 


TGGGGGGAGT 


840 


GAGCATGGCC 


ATTATTGAAG 


AGTTAATTCA 


CCAGGGCCAT 


GAAATTGTGG 


TAATCGAAAA 


900 
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AATCGTTTCT 


TGCATACGGC 


CCGCTCCCTG 


GGGGTGCCCG 


TAATTGTGGA 


960 


r< n Annexe c car 


CTAGAAAGAA 


CGTTGGCCTG 


CGCCAATATC 


AACCGAGCCG 


AAGCCATTGT 


1020 


GGTGGCCACC 


AGCGACGACA 


CCGTTAACTT 


GGAAATTGGC 


CTAACTGCCA 


AGGCGATCGC 


1080 


CCCTAGCCTG 


CCAGTGGTGT 


TGCGTTGCCA 


GGATGCCCAG 


TTTAGCCTGT 


CCCTGCAGGA 


1140 


AGTATTTGAA 


TTTGAAACGG 


TGCTTTGTCC 


GGCGGAATTG 


GCCACCTATT 


CCTTTGCGGC 


1200 


GGCGGCCCTG 


GGGGGCAAAA 


TTTTGGGCAA 


CGGCATGACC 


GATGATTTGC 


TGTGGGTAGC 


1260 


CCTAGCCACC 


TTAATCACTC 


CTAACCATCC 


CTTTGCCGAC 


CAATTGGTTA 


AAATTGCAGC 


1320 




GATTTCGTTC 


CCCTCTATCT 


AGAACGGGGT 


GGCAAAACCA 


TCCATAGCTG 


1380 


GGAATTATTG 


GGTACCCATC 


TCGACTCTGG 


AGACGTGTTG 


TATTTAACCA 


TGCCCGCCAC 


1440 


TGGCCTAGAG 


CAACTTTGGC 


GATCGCCCCG 


TGCCACTGCT 


GATCCTCTGG 


ACTCTTTTTT 


1500 


ggS^ttagcat 


GGGGGGATGG 


AACTCTTGAC 


TCGGCCCAAT 


GGTGATCAAG 


AAAGAAC GCT 


1560 


TTift'CTATGT 


TTAGTATTTT 


TAAGTTAACC 


AACAGCAGAG 


GATAACTTCC 


AAAAGAAATT 


1620 


aaITctcaaaa 


AGTAGCAAAA 


TAAGTTTAAT 


TCATAACTGA 


GTTTTACTGC 


TAAACAGCGG 


1680 


TGCAAAAAAG 


TCAGATAAAA 


TAAAAGCTTC 


ACTTCGGTTT 


TATATTGTGA 


CCATGGTTCC 


1740 


CAGGCATCTG 


CTCTAGGGAG 


TTTTTCCGCT 


GCCTTTAGAG 


AGTATTTTCT 


CCAAGTCGGC 


1800 


in i 

TAACTCCCCC 


ATTTTTAGGC 


AAAATCATAT 


ACAGACTATC 


CCAATATTGC 


CAGAGCTTTG 


1860 


ATQACTCACT 


GTAGAAGGCA 


GACTAAAATT 


CTAGCAATGG 


ACTCCCAGTT 


GGAATAAATT 


1920 


TTTAGTCTCC 


CCCGGCGCTG 


GAGTTTTTTT 


GTAGTTAATG 


GCGGTATAAT 


GTGAAAGTTT 


1980 


TTTATCTATT 


TAAATTTATA 


A ATG CTA 
Met Leu 
1 


ACA GCG GAA 
Thr Ala Glu 
5 


AGA ATT AAA TTT ACC 
Arg He Lys Phe Thr 
10 


2031 



CAG AAA CGG GGG TTT CGT CGG GTA CTA AAC CAA CGG GTG GAT GCC TAC 
Gin Lys Arg Gly Phe Arg Arg Val Leu Asn Gin Arg Val Asp Ala Tyr 
15 20 25 
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TTT GCC GAG CAT GGC CTG ACC CAA AGG GAT AAT CCC TCC ATG TAT CTG 
Phe Ala Glu His Gly Leu Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu 
30 35 40 

AAA ACC CTG ATT ATT GTG CTC TGG TTG TTT TCC GCT TGG GCC TTT GTG 
Lys Thr Leu lie lie Val Leu Trp Leu Phe Ser Ala Trp Ala Phe Val 
45 50 55 

CTT TTT GCT CCA GTT ATT TTT CCG GTG CGC CTA CTG GGT TGT ATG GTT 
Leu Phe Ala Pro Val lie Phe Pro Val Arg Leu Leu Gly Cys Met Val 
60 65 70 

TTG GCG ATC GCC TTG GCG GCC TTT TCC TTC AAT GTC GGC CAC GAT GCC 
Leu Ala lie Ala Leu Ala Ala Phe Ser Phe Asn 'Val Gly His Asp Ala 
75 80 85 90 

AAC CAC AAT GCC TAT TCC TCC AAT CCC CAC ATC AAC CGG GTT CTG GGC 
Asn His Asn Ala Tyr Ser Ser Asn Pro His lie Asn Arg Val Leu Gly 
g 95 100 105 

ATpj ACC TAC GAT TTT GTC GGG TTA TCT AGT TTT CTT TGG CGC TAT CGC 
Mefe: Thr Tyr Asp Phe Val Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg 
: 110 115 120 

CAd AAC TAT TTG CAC CAC ACC TAC ACC AAT ATT CTT GGC CAT GAC GTG 
HiMi Asn Tyr Leu His His Thr Tyr Thr Asn lie Leu Gly His Asp Val 
CP 125 130 135 

Gm ATC CAT GGA GAT GGC GCA GTA CGT ATG AGT CCT GAA CAA GAA CAT 
G1HJ He His Gly Asp Gly Ala Val Arg Met Ser Pro Glu Gin Glu His 

flJ 140 145 . 150 

GTTs GGT ATT TAT CGT TTC CAG CAA TTT TAT ATT TGG GGT TTA TAT CTT 
Val Gly He Tyr Arg Phe Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu 
155 160 165 170 

TTC ATT CCC TTT TAT TGG TTT CTC TAC GAT GTC TAC CTA GTG CTT AAT 
Phe lie Pro Phe Tyr Trp Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn 
175 180 185 

AAA GGC AAA TAT CAC GAC CAT AAA ATT CCT CCT TTC CAG CCC CTA GAA 
Lys Gly Lys Tyr His Asp His Lys He Pro Pro Phe Gin Pro Leu Glu 
190 195 200 

TTA GCT AGT TTG CTA GGG ATT AAG CTA TTA TGG CTC GGC TAC GTT TTC 
Leu Ala Ser Leu Leu Gly He Lys Leu Leu Trp Leu Gly Tyr Val Phe 
205 210 215 

GGC TTA CCT CTG GCT CTG GGC TTT TCC ATT CCT GAA GTA TTA ATT GGT 
Gly Leu Pro Leu Ala Leu Gly Phe Ser He Pro Glu Val Leu He Gly 
220 225 230 

GCT TCG GTA ACC TAT ATG ACC TAT GGC ATC GTG GTT TGC ACC ATC TTT 
Ala Ser Val Thr Tyr Met Thr Tyr Gly He Val Val Cys Thr He Phe 
235 240 245 250 
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ATG CTG GCC CAT GTG TTG GAA TCA ACT GAA TTT CTC ACC CCC GAT GGT 

Met Leu Ala His Val Leu Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly 
255 260 265 

GAA TCC GGT GCC ATT GAT GAC GAG TGG GCT ATT TGC CAA ATT CGT ACC 

Glu Ser Gly Ala He Asp Asp Glu Trp Ala He Cys Gin He Arg Thr 
270 * ~ 275 280 

ACG GCC AAT TTT GCC ACC AAT AAT CCC TTT TGG AAC TGG TTT TGT GGC 

Thr Ala Asn Phe Ala Thr Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly 
285 290 295 

GGT TTA AAT CAC CAA GTT ACC CAC CAT CTT TTC CCC AAT ATT TGT CAT 

Gly Leu Asn His Gin Val Thr His His Leu Phe Pro Asn He Cys His 
300 305 310 

ATT CAC TAT CCC CAA TTG GAA AAT ATT ATT AAG GAT GTT TGC CAA GAG 

He His Tyr Pro Gin Leu Glu Asn He He Lys Asp Val Cys Gin Glu 
3ft5 320 325 330 

tJIt GGT GTG GAA TAT AAA GTT TAT CCC ACC TTC AAA GCG GCG ATC GCC 

Pfee Gly Val Glu Tyr Lys Val Tyr Pro Thr Phe Lys Ala Ala He Ala 

2 335 340 345 

TCT AAC TAT CGC TGG CTA GAG GCC ATG GGC AAA GCA TCG TGACATTGCC 

sir Asn Tyr Arg Trp Leu Glu Ala Met Gly Lys Ala Ser 
m 350 355 360 



TjfpGGATTGA 


AGCAAAATGG 


CAAAATCCCT 


CGTAAATCTA 


TGATCGAAGC 


CTTTCTGTTG 


3148 


CpSCGCCGACC 


AAATCCCCGA 


TGCTGACCAA 


AGGTTGATGT 


TGGCATTGCT 


CCAAACCCAC 


3208 


tStgaggggg 


TTCATTGGCC 


GCAGTTTCAA 


GCTGACCTAG 


GAGGCAAAGA 


TTGGGTGATT 


3268 


TTGCTCAAAT 


CCGCTGGGAT 


ATTGAAAGGC 


TTCACCACCT 


TTGGTTTCTA 


CCCTGCTCAA 


3328 


TGGGAAGGAC 


AAACCGTCAG 


AATTGTTTAT 


TCTGGTGACA 


CCATCACCGA 


CCCATCCATG 


3388 


TGGTCTAACC 


CAGCCCTGGC 


CAAGGCTTGG 


ACCAAGGCCA 


TGCAAATTCT 


CCACGAGGCT 


3448 


AGGCCAGAAA 


AATTATATTG 


GCTCCTGATT 


TCTTCCGGCT 


ATCGCACCTA 


CCGATTTTTG 


3508 


AGCATTTTTG 


CCAAGGAATT 


CTATCCCCAC 


TATCTCCATC 


CCACTCCCCC 


GCCTGTACAA 


3568 


AATTTTATCC 


ATCAGCTAGC 










3588 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Leu Thr Ala Glu Arg He Lys Phe Thr Gin Lys Arg Gly Phe Arg 
1 5. 10 15 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 
20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He lie Val 
35 40 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 

Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 
65 " 70 75 80 

Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
^ 85 90 95 

Set Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 
100 105 110 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
CP 115 120 125 

THr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Gly 
m 130 135 140 

ma Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
lfflB 150 155 160 

Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 
165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 185 190 

His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Le,u Gly 
195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 ~ 220 

Gly Phe Ser He Pro Glu Val Leu He Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 
245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
260 265 270 



Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Aan Phe Ala Thr 
275 280 285 

Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
290 295 300 

Thr His His Leu Phe Pro Asn He Cys His He His Tyr Pro Gin Leu 
305 310 315 320 

Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 
325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

<2jp INFORMATION FOR SEQ ID N0:3: 

Jj (i) SEQUENCE CHARACTERISTICS: 

" : (A) LENGTH: 1884 base pairs 

t (B) TYPE: nucleic acid 

I: (C) STRANDEDNESS : both 

y - (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fU (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AG<f9?TCACTT CGGTTTTATA TTGTGACCAT GGTTCCCAGG CATCTGCTCT AGGGAGTTTT 60 

TCCGCTGCCT TTAGAGAGTA TTTTCTCCAA GTCGGCTAAC TCCCCCATTT TTAGGCAAAA 120 

TCATATACAG ACTATCCCAA TATTGCCAGA GCTTTGATGA CTCACTGTAG AAGGCAGACT 180 

AAAATTCTAG CAATGGACTC CCAGTTGGAA TAAATTTTTA GTCTCCCCCG GCGCTGGAGT 240 

TTTTTTGTAG TTAATGGCGG TATAATGTGA AAGTTTTTTA TCTATTTAAA TTTATAAATG 300 

CTAACAGCGG AAAGAATTAA ATTTACCCAG AAACGGGGGT TTCGTCGGGT ACTAAACCAA 360 

CGGGTGGATG CCTACTTTGC CGAGCATGGC CTGACCCAAA GGGATAATCC CTCCATGTAT 42 0 

CTGAAAACCC TGATTATTGT GCTCTGGTTG TTTTCCGCTT GGGCCTTTGT GCTTTTTGCT 4 80 

CCAGTTATTT TTCCGGTGCG CCTACTGGGT TGTATGGTTT TGGCGATCGC CTTGGCGGCC 54 0 

TTTTCCTTCA ATGTC GGCC A CGATGCCAAC CACAATGCCT ATTCCTCCAA TCCCCACATC 600 

AACCGGGTTC TGGGCATGAC CTACGATTTT GTCGGGTTAT CTAGTTTTCT TTGGCGCTAT 660 

CGCCACAACT ATTTGCACCA CACCTACACC AATATTCTTG GCCATGACGT GGAAATCCAT 720 
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GGAGATGGCG 


CAGTACGTAT 


GAGTCCTGAA 


CAAGAACATG 


TTGGTATTTA 


TCGTTTCCAG 


780 


CAATTTTATA 


TTTGGGGTTT 


ATATCTTTTC 


ATTCCCTTTT 


ATTGGTTTCT 


CTACGATGTC 


840 


TACCTAGTGC 


TTAATAAAGG 


CAAATATCAC 


GACCATAAAA 


TTCCTCCTTT 


CCAGCCCCTA 


900 


GAATTAGC T A 


GTTTGCTAGG 


GATTAAGCTA 


TTATGGCTCG 


GCTACGTTTT 


CGGCTTACCT 


960 


CTGGCTCTGG 


GCTTTTCCAT 


TCCTGAAGTA 


TTAATTGGTG 


CTTCGGTAAC 


CTATATGACC 


1020 


TATGGCATCG 


TGGTTTGCAC 


CATCTTTATG 


CTGGCCCATG 


TGTTGGAATC 


AACTGAATTT 


1080 


CTCACCCCCG 


ATGGTGAATC 


CGGTGCCATT 


GATGACGAGT 


GGGCTATTTG 


CCAAATTCGT 


1140 


ACCACGGCCA 


ATTTTGC C AC 


CAATAATCCC 


TTTTGGAACT 


GGTTTTGTGG 


CGGTTTAAAT 


1200 


CACCAAGTTA 


CCCACCATCT 


TTTCCCCAAT 


ATTTGTCATA 


TTCACTATCC 


CCAATTGGAA 


1260 


AATATTATTA 


AGGATGTTTG 


CCAAGAGTTT 


GGTGTGGAAT 


ATAAAGTTTA 


TCCCACCTTC 


1320 


AA|icGGCGA 


TCGCCTCTAA 


CTATCGCTGG 


CTAGAGGCCA 


TGGGCAAAGC 


ATCGTGACAT 


1380 


tgSttggGa 


TTGAAGCAAA 


ATGGCAAAAT 


CCCTCGTAAA 


TCTATGATCG 


AAGCCTTTCT 


1440 


GTTGCCCGCC 


GACCAAATCC 


CCGATGCTGA 


CCAAAGGTTG 


ATGTTGGCAT 


TGCTCCAAAC 


1500 


ccaStttgag 


GGGGTTCATT 


GGCCGCAGTT 


TCAAGCTGAC 


CTAGGAGGCA 


AAGATTGGGT 


1560 


GATTTTGCTC 


AAATCCGCTG 


GGATATTGAA 


AGGCTTCACC 


ACCTTTGGTT 


TCTACCCTGC 


1620 


TCAATGGGAA 


GGACAAACCG 


TCAGAATTGT 


TTATTCTGGT 


GACACCATCA 


CCGACCCATC 


1680 


CATgTGGTCT 


AACCCAGCCC 


TGGCCAAGGC 


TTGGACCAAG 


GCCATGCAAA 


TTCTCCACGA 


1740 


GGCTAGGCCA 


GAAAAATTAT 


ATTGGCTCCT 


GATTTCTTCC 


GGCTATCGCA 


CCTACCGATT 


1800 


TTTGAGCATT 


TTTGCCAAGG 


AATTCTATCC 


CCACTATCTC 


CATCCCACTC 


CCCCGCCTGT 


1860 


ACAAAATTTT 


ATCCATCAGC 


TAGC 








1884 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
AATATCTGCC TACCCTCCCA AAGAGAGTAG TCATTTTTCA TCAATGGCTG CTCAAATCAA 60 
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GAAATACATT 


ACCTCAGATG 


AACTCAAGAA 


CC AC GAT AAA 


CCCGGAGATC 


TATGGATCTC 


120 


GATTCAAGGG 


aaagcctatIg 


ATGTTTCGGA 


TTGGGTGAAA 


GACCATCCAG 


GTGGCAGCTT 


180 


TCCCTTGAAG 


AGTCTTGCTG 


GTCAAGAGGT 


AACTGATGCA 


TTTGTT GC AT 


TCCATCCTGC 


240 


CTCTACATGG 


AAGAATCTTG 


ATAAGTTTTT 


CACTGGGTAT 


TATCTTAAAG 


ATTACTCTGT 


300 


TTCTGAGGTT 


TCTAAAGATT 


ATAGGAAGCT 


TGTGTTTGAG 


TTTTCTAAAA 


TGGGTTTGTA 


360 


TGACAAAAAA 


GGTCATATTA 


TGTTTGCAAC 


TTTGTGCTTT 


ATAGCAATGC 


TGTTTGCTAT 


420 


GAGTGTTTAT 


GGGGTTTTGT 


TTTGTGAGGG 


TGTTTTGGTA 


CATTTGTTTT 


CTGGGTGTTT 


480 


GATGGGGTTT 


CTTTGGATTC 


AGAGTGGTTG 


GATTGGACAT 


GATGCTGGGC 


ATTATATGGT 


540 


AGTGTCTGAT 


TCAAGGCTTA 


ATAAGTTTAT 


GGGTATTTTT 


GCTGCAAATT 


GTCTTTCAGG 


600 


AAfAAGTATT 


GGTTGGTGGA 


AATGGAACCA 


TAATGCACAT 


CACATTGCCT 


GTAATAGCCT 


660 


tgS^tatgac 


CCTGATTTAC 


AAT AT AT AC C 


ATTCCTTGTT 


GTGTCTTCCA 


AGTTTTTTGG 


720 


TTGACTCACC 


TCTCATTTCT 


ATGAGAAAAG 


GTTGACTTTT 


GACTCTTTAT 


CAAGATTCTT 


780 


TGilAAGTTAT 


CAACATTGGA 


CATTTTACCC 


TATTATGTGT 


GCTGCTAGGC 


TCAATATGTA 


840 


TGTACAATCT 


CTCATAATGT 


TGTTGACCAA 


GAGAAATGTG 


TCCTATCGAG 


CTCAGGAACT 


900 


ctugggatgc 


CTAGTGTTCT 


CGATTTGGTA 


CCCGTTGCTT 


GTTTCTTGTT 


TGCCTAATTG 


960 


ggp^gaaaga 


ATTATGTTTG 


TTATTGCAAG 


TTTATCAGTG 


ACTGGAATGC 


AACAAGTTCA 


1020 


GTftTCCTTG 


AACCACTTCT 


CTTCAAGTGT 


TTATGTTGGA 


AAGCCTAAAG 


GGAATAATTG 


1080 


GTTTGAGAAA 


CAAACGGATG 


GGACACTTGA 


CATTTCTTGT 


CCTCCTTGGA 


TGGATTGGTT 


1140 


TCATGGTGGA 


TTGCAATTCC 


AAATTGAGCA 


TCATTTGTTT 


CCCAAGATGC 


CTAGATGCAA 


1200 


CCTTAGGAAA 


ATCTCGCCCT 


ACGTGATCGA 


GTTATGCAAG 


AAACATAATT 


TGCCTTACAA 


1260 


TTATGCATCT 


TTCTCCAAGG 


CCAATGAAAT 


GACACTCAGA 


ACATTGAGGA 


ACACAGCATT 


1320 


GCAGGCTAGG 


GATATAACCA 


AGCCGCTCCC 


GAAGAATTTG 


GTATGGGAAG 


CTCTTCACAC 


1380 


TCATGGTTAA 


AATTACCCTT 


AGTTCATGTA 


ATAATTTGAG 


ATTATGTATC 


TCCTATGTTT 


1440 


GTGTCTTGTC 


TTGGTTCTAC 


TTGTTGGAGT 


CATTGCAACT 


TGTCTTTTAT 


GGTTTATTAG 


1500 


ATGTTTTTTA 


ATATATTTTA 


GAGGTTTTGC 


TTTCATCTCC 


ATTATTGATG 


AATAAGGAGT 


1560 


TGCATATTGT 


CAATTGTTGT 


GCTCAATATC 


TGATATTTTG 


GAATGTACTT 


TGTACCACTG 


1620 


TGTTTTCAGT 


TGAAGCTCAT 


GTGTACTTCT 


ATAGACTTTG 


TTTAAATGGT 


TATGTCATGT 


1680 


TATTT 












1685 




(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Gin He Lys Lys Tyr He Thr Ser Asp Glu Leu Lys Asn 
1 5 10 15 

His Asp Lys Pro Gly Asp Leu Trp He Ser He Gin Gly Lys Ala Tyr 
20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
% 35 40 45 

° Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 

fy 50 55 60 

H Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 

-1 65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu 
M= 85 90 95 

n val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His He 
U 100 105 HO 

U Met Phe Ala Thr Leu Cys Phe He Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cvs Leu Met Gly Phe Leu Trp He Gin Ser Gly Trp He Gly His Asp 
145 150 155 160 

Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
165 170 175 

Gly He Phe Ala Ala Asn Cys Leu Ser Gly He Ser He Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 
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Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
245 250 255 

He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp He Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Ser Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys He Ser Pro Tyr Val He Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 410 415 

Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 



INFORMATION FOR SEQ ID NO : 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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Trp He Gly His Asp Ala Gly His 
1 5 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Asn Val Gly His Asp Ala Asn His 
1 5 

(2)OINFORMATION FOR SEQ ID NO : 8 : 

pj (i) SEQUENCE CHARACTERISTICS: 

Ci (A) LENGTH: 8 amino acids 

=i (B) TYPE: amino acid 

§j (D) TOPOLOGY: linear 

J, (ii) MOLECULE TYPE: Peptide 

?(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

H= Val Leu Gly His Asp Cys Gly His 
g 1 5 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S 

Val He Ala His Glu Cys Gly His 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val lie Gly His Asp Cys Ala His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 
?5 (B) TYPE: amino acid 

m (D) TOPOLOGY: linear 

rl(ii) MOLECULE TYPE: Peptide 
'(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Val Gly His Asp Cys Gly His 

J 1 5 

(2) j INFORMATION FOR SEQ ID NO: 12: 

Q 

N= (i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

His Asn Ala His His 

1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

His Asn Tyr Leu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

His Arg Thr His His 
M 1 ~ 5 

(2) "INFORMATION FOR SEQ ID NO: 15: 
1 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 
W (b) TYPE: amino acid 

ff 1 (D) TOPOLOGY: linear 

? 1 (ii) MOLECULE TYPE: Peptide 

tU 

rU(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Q His Arg Arg His His 

H 1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

His Asp Arg His His 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Asp Gin His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

h (A) LENGTH: 5 amino acids 

Q (B) TYPE: amino acid 

nj (D) TOPOLOGY: linear 

Ct (ii) MOLECULE TYPE: Peptide 

m (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Asp His His His 

y 1 5 

(2 ; £ INFORMATION FOR SEQ ID NO: 19: 
3 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

His Asn His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Phe Gin He Glu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

His Gin Val Thr His His 
,. 1 5 
( 2 jg INFORMATION FOR SEQ ID NO: 22: 

W (i) SEQUENCE CHARACTERISTICS: 

AO 

H (A) LENGTH: 5 amino acids 

Ul (B) TYPE: amino acid 

OR (D) TOPOLOGY: linear 

S-M'ii) MOLECULE TYPE: Peptide 

nj 

fU (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22 

y= 

hs His Val He His His 
C 1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

His Val Ala His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

His lie Pro His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Uii) MOLECULE TYPE: Peptide 

ffxi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

«His Val Pro His His 

qi 5 

2) INFORMATION FOR SEQ ID NO: 26: 

''t.. (i) SEQUENCE CHARACTERISTICS: 

y (A) LENGTH: 1702 base pairs 

Jy (B) TYPE: nucleic acid 

H= (C) STRANDEDNESS: both 

O (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 48.. 1406 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 48.. 1406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CCCCAAAAAT TTTCATTGTT CTCCATCTGG ACCACAGCAT CCACACA ATG GAG GGC 56 

Met Glu Gly 
1 

GAA GCT AAG AAG TAT ATC ACG GCG GAG GAC CTC CGC CGC CAC AAC AAG 104 
Glu Ala Lys Lys Tyr lie Thr Ala Glu Asp Leu Arg Arg His Asn Lys 
5 10 15 

TCC GGC GAT CTC TGG ATC TCC ATC CAG GGC AAG GTC TAC GAC TGC TCT 152 



Ser Gly Asp Leu Trp He Ser He Gin Gly Lys Val Tyr Asp Cys Ser 
20 25 30 35 

CGG TGG GCG GCG GAG CAC CCC GGC GGC GAG GTC CCG CTC CTC AGT CTG 
Arg Trp Ala Ala Glu His Pro Gly Gly Glu Val Pro Leu Leu Ser Leu 
40 45 50 

GCC GGC CAG GAC GTC ACC GAC GCC TTC ATT GCG TAC CAC CCG GGC ACG 
Ala Gly Gin Asp Val Thr Asp Ala Phe He Ala Tyr His Pro Gly Thr 
55 60 65 

GCG TGG CGG CAT CTG GAT CCG CTC TTC ACC GGC TAC TAC TAC CTC AAG 
Ala Trp Arg His Leu Asp Pro Leu Phe Thr Gly Tyr Tyr Tyr Leu Lys 
70 75 80 

GAC TTC GAA GTG TCG GAG ATC TCC AAG GAC TAC CGG AGG CTT TTG AAC 
Asp Phe Glu Val Ser Glu He Ser Lys Asp Tyr Arg Arg Leu Leu Asn 
85 90 95 

GAG ATG TCG CGG TCC GGG ATC TTC GAG AAG AAG GGC CAC CAC ATC ATG 
Gluglet Ser Arg Ser Gly He Phe Glu Lys Lys Gly His His He Met 
100m 105 , HO 115 

TGG ACG TTC GTC GGC GTT GCG GTC ATG ATG GCG GCA ATC GTC TAC GGC 
Trp . : Thr Phe Val Gly Val Ala Val Met Met Ala Ala He Val Tyr Gly 
%i . 120 125 130 

GTG : CTG GCG TCG GAG TCC GTC GGA GTT CAC ATG CTC TGC GGC GCA CTG 
Val Leu Ala Ser Glu Ser Val Gly Val His Met Leu Cys Gly Ala Leu 

Ljj 135 140 145 

CTG GGC TTG CTG TGG ATC CAA GCC GCG TAT GTG GGC CAT GAC TCC GGC 
Leu^ly Leu Leu Trp He Gin Ala Ala Tyr Val Gly His Asp Ser Gly 
N 150 155 160 

CAT TAC CAG GTG ATG CCA ACC CGT GGA TAC AAC AGA ATC ACG CAA CTC 
His Tyr Gin Val Met Pro Thr Arg Gly Tyr Asn Arg He Thr Gin Leu 
165 170 175 

ATA GCA GGC AAC ATC CTA ACC GGA ATC AGC ATC GCG TGG TGG AAG TGG 
He Ala Gly Asn He Leu Thr Gly He Ser He Ala Trp Trp Lys Trp 
180 185 190 195 

ACC CAC AAC GCC CAC CAC CTC GCC TGC AAC AGC CTC GAC TAC GAC CCC 
Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr Asp Pro 
200 205 210 

GAC CTC CAG CAC ATC CCC GTA TTC GCC GTC TCC ACC CGA CTC TTC AAC 
Asp Leu Gin His He Pro Val Phe Ala Val Ser Thr Arg Leu Phe Asn 
215 220 225 

TCC ATC ACC TCG GTC TTC TAT GGC CGA GTC CTG AAA TTC GAC GAA GTG 
Ser He Thr Ser Val Phe Tyr Gly Arg Val Leu Lys Phe Asp Glu Val 
230 235 240 
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GCA CGG TTC CTA GTC AGC TAC CAG CAC TGG ACC TAC TAC CCG GTC ATG 
Ala Arg Phe Leu Val Ser Tyr Gin His Trp Thr Tyr Tyr Pro Val Met 
245 250 255 

ATC TTC GGC CGA GTC AAC CTC TTC ATC CAG ACC TTT TTA TTG CTC CTC 
lie Phe Gly Arg Val Asn Leu Phe lie Gin Thr Phe Leu Leu Leu Leu 
260 265 270 275 

ACC AGG CGC GAC GTC CCT GAC CGC GCT CTA AAC TTA ATG GGT ATC GCG 
Thr Arg Arg Asp Val Pro Asp Arg Ala Leu Asn Leu Met Gly He Ala 
280 285 290 

GTT TTC TGG ACG TGG TTC CCG CTC TTC GTA TCT TGT CTC CCG AAC TGG 
Val Phe Trp Thr Trp Phe Pro Leu Phe Val Ser Cys Leu Pro Asn Trp 
295 300 305 

CCT GAA CGG TTC GGG TTC GTC CTC ATC AGC TTT GCG GTC ACG GCG ATC 
Pro Glu Arg Phe Gly Phe Val Leu He Ser Phe Ala Val Thr Ala He 
U 310 315 320 

CAGppAC GTC CAG TTC ACG CTC AAC CAC TTC TCC GGC GAC ACA TAC GTG 
Gln:;:His Val Gin Phe Thr Leu Asn His Phe Ser Gly Asp Thr Tyr Val 
f 25 330 335 

GGC;£CC CCC AAG GGC GAC AAC TGG TTC GAG AAG CAG ACG AAA GGG ACG 
Glyjfro Pro Lys Gly Asp Asn Trp Phe Glu Lys Gin Thr Lys Gly Thr 
340^ 345 350 355 

ATC GAT ATC ACG TGC CCA CCG TGG ATG GAC TGG TTC TTT GGT GGG CTG 
Ile=Asp He Thr Cys Pro Pro Trp Met Asp Trp Phe Phe Gly Gly Leu 
IP. 360 365 370 

CAGC3TC CAG TTG GAG CAC CAC TTG TTC CCT AGG CTG CCG CGT GGG CAG 
Gin Phe Gin Leu Glu His His Leu Phe Pro Arg Leu Pro Arg Gly Gin 
375 380 385 

CTT AGG AAG ATT GCG CCC TTG GCT CGG GAC TTG TGT AAG AAG CAC GGG 
Leu Arg Lys He Ala Pro Leu Ala Arg Asp Leu Cys Lys Lys His Gly 
390 395 400 

ATG CCG TAT AGG AGC TTC GGG TTT TGG GAC GAC GCT AAT GTC AGG ACA 
Met Pro Tyr Arg Ser Phe Gly Phe Trp Asp Asp Ala Asn Val Arg Thr 
405 410 415 

ATT CGG ACG CTG AGG GAT GCG GCG GTT CAG GCG CGT GAC CTT AAT TCG 
He Arg Thr Leu Arg Asp Ala Ala Val Gin Ala Arg Asp Leu Asn Ser 
42 0 425 430 435 

GCC CCG TGC CCT AAG AAA CTT GGG TAT GGG GAA GCT TAT AAC ACC CAT 
Ala Pro Cys Pro Lys Lys Leu Gly Tyr Gly Glu Ala Tyr Asn Thr His 
440 445 450 

GGT TGA TTGTGGTTTT GTGTTGTGGG TTGGAGGATC TTCTTATTAT TGATTTATGT 
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CCACAATATT GAACTGAATA ACCATGGAAG GC ACT AC GTT CAGCTTAACT TTGCTTAACT 1516 

TTGCTAGCTG GTTGCGTTCC CTTGTTGGGG GC AAAGTGCA GTATTTATTT TCTTATCCCA 1576 

TGTACTTTTT GATTATTGTT CTTATTCGTA TCATAAATAA TTTATTATTG ATTAATTTTT 1636 

GTTGTAGTTG GGTGTC TATA GCAAGTTTAT AATAC TGAGA TATATTTTTT TGGTAAAAAA 1696 

AAAAAA 1702 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

M (ii) MOLECULE TYPE: protein 

~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met^iflu Gly Glu Ala Lys Lys Tyr lie Thr Ala Glu Asp Leu Arg Arg 

1 :1 5 10 15 

His nan Lys Ser Gly Asp Leu Trp He Ser He Gin Gly Lys Val Tyr 
m 20 25 30 

Asp Gys Ser Arg Trp Ala Ala Glu His Pro Gly Gly Glu Val Pro Leu 
m 35 40 45 

Leu ier Leu Ala Gly Gin Asp Val Thr Asp Ala Phe He Ala Tyr His 
O 50 55 60 

Pro Gly Thr Ala Trp Arg His Leu Asp Pro Leu Phe Thr Gly Tyr Tvr 
65 70 75 80 

Tyr Leu Lys Asp Phe Glu Val Ser Glu He Ser Lys Asp Tyr Arg Arg 
85 9.0 95 

Leu Leu Asn Glu Met Ser Arg Ser Gly He Phe Glu Lys Lys Gly His 
100 105 110 

His He Met Trp Thr Phe Val Gly Val Ala Val Met Met Ala Ala He 
115 120 125 

Val Tyr Gly Val Leu Ala Ser Glu Ser Val Gly Val His Met Leu Cys 
130 135 140 

Gly Ala Leu Leu Gly Leu Leu Trp He Gin Ala Ala Tyr Val Glv His 
145 150 155 160 

Asp Ser Gly His Tyr Gin Val Met Pro Thr Arg Gly Tyr Asn Arg He 
165 170 175 
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Thr Gin Leu He Ala Gly Asn He Leu Thr Gly He Ser He Ala Trp 
180 185 190 

Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp 
195 200 205 

Tyr Asp Pro Asp Leu Gin His He Pro Val Phe Ala Val Ser Thr Arg 
210 215 220 

Leu Phe Asn Ser He Thr Ser Val Phe Tyr Gly Arg Val Leu Lys Phe 
225 230 235 240 

Asp Glu Val Ala Arg Phe Leu Val Ser Tyr Gin His Trp Thr Tyr Tyr 
245 250 255 

Pro Val Met He Phe Gly Arg Val Asn Leu Phe He Gin Thr Phe Leu 
260 265 270 

Leu -Leu Leu Thr Arg Arg Asp Val Pro Asp Arg Ala Leu Asn Leu Met 
O 275 280 285 

Q 

Gly He Ala Val Phe Trp Thr Trp Phe Pro Leu Phe Val Ser Cys Leu 
y3290 295 300 

M 

Prbcksn Trp Pro Glu Arg Phe Gly Phe Val Leu He Ser Phe Ala Val 

305pi 310 315 320 

Thr[Ala He Gin His Val Gin Phe Thr Leu Asn His Phe Ser Gly Asp 
», 325 330 335 

Thr Tyr Val Gly Pro Pro Lys Gly Asp Asn Trp Phe Glu Lys Gin Thr 
£ 340 345 350 

Lys^Gly Thr He Asp He Thr Cys Pro Pro Trp Met Asp Trp Phe Phe 
355 360 365 

Gly Gly Leu Gin Phe Gin Leu Glu His His Leu Phe Pro Arg Leu Pro 
370 375 ^ 380 

Arg Gly Gin Leu Arg Lys He Ala Pro Leu Ala Arg Asp Leu Cys Lys 
385 390 395 400 

Lys His Gly Met Pro Tyr Arg Ser Phe Gly Phe Trp Asp Asp Ala Asn 
405 410 415 

Val Arg Thr He Arg Thr Leu Arg Asp Ala Ala Val Gin Ala Arg Asp 
420 425 430 

Leu Asn Ser Ala Pro Cys Pro Lys Lys Leu Gly Tyr Gly Glu Ala Tyr 
435 440 445 

Asn Thr His Gly * 
450 



